T 


























ECHNOLOGY,INC. 


Features 


Based on ROSS’ fourth-generation 
hyperSPARC™ processor 


Each module consists of two complete 
SPARC CPUs 


Each processor includes 
— RT620C Central Processing Unit 
(CPU) 


— RT626 Cache Controller, Memory 
Management, and Tag Unit 
(CMTU) 

— Four (512-Kbyte) or eight 
(1-Mbyte) RT628 Cache Data Units 
(CDUs) 
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hyperSPARC™ Dual-CPU Module 


— Hardware support for symmetric, 
shared-memory multiprocessing 


— Level 2 MBus support for cache 
coherency 


SPARC compliant 


— SPARC Instruction Set Architec- 
ture (ISA) Version 8 compliant 


— Conforms to SPARC Reference 
MMU Architecture 


— Conforms to SPARC Level 2 MBus 
Module Specification (Revision 1.2) 


Dual-clock architecture 


— Zero-wait-state, 512-Kbyte or 
1-Mbyte 2nd-level cache 


— Demand-paged virtual memory 
management 


Module design 


— MBus-standard form factor: 3.30” 
(8.34 cm) x 5.78” (14.67 cm) 


— Provides CPU upgrade path at 
module level 


— Advanced packaging technology 
for a compact design 


High performance * 


Intra-Module' Bus incorporates 10 — CPU scaleable up to 150 MHz — 178 SPECint92 (per CPU) 

= = U us 1 WwW 

voltage logic to reduce power and — MBus scaleable up to 50 MHz — 209 SPECfp92 (per CPU) 
increase speed Each hyperSPARC processor features * in a 50MHz MBus system 


— Dual-level caches 
Full multiprocessing implementation 


RT620C 
CPU 


Selection Guide 


— Superscalar SPARC CPU with inte- 
grated floating point unit and 
8-Kbyte instruction cache 


RT620C 
CPU 


Figure 1. Logic Block Diagram 


MBus (Level 2) 


























Part Number: RS4026* -150/512 
CPU Operating Frequency (MHz) 150 
Typical Power Consumption (w)** 52 
Second-level Cache Size (per CPU) 512K 
SPECint92 / SPECfp92 178 / 209 
SPECrateint92 / SPECratefp92 7721 / 8769 
(dual processor) 

SPECrateint92 / SPECratefp92 

(quad processor) 











*See Appendix C for hyperSPARC ordering information ** Commercial 


e ROSS Technology, Inc. ¢ 5316 Hwy. 290 West e Austin, Texas 78735 e TEL (512) 436-2000 e FAX (512) 892-3036 
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Functional Description 


The RS4026 hyperSPARC Module is a complete dual- 
SPARC CPU, including on-board primary and secondary 
cache memories. It is packaged as a compact PCB and 
interfaces to the remainder of the system via a SPARC-stan- 
dard MBus connector. Each of the two CPUs on the RS4026 
consists of a high-speed superscalar, highly pipelined integer 
processor with an on-chip floating-point unit (RT620), a 
Cache Controller, Memory Management, and Tag Unit 
(RT625), and four or eight Cache Data Units (RT628). The 
RS4026 fits within the clearance envelope for MBus modules 
per the SPARC MBus Specification. 


The RS4026 interfaces to the rest of the system via the 
SPARC MBus and conforms to the SPARC Reference MMU. 
This standardization allows the RS4026 to be interchange- 
able with other SPARC MBus-based CPU modules without 
having to modify any portion of the memory system or I/O. 
This CPU “building block” strategy not only decreases the 
user’s time to market, but provides a mechanism for upgrad- 
ing in the field. 


Component Overview 


Superscalar SPARC Processor (RT620C) 


The RT620C Central Processing Unit is the heart of ROSS’ 
fourth-generation of microprocessor. The RT620C CPU ar- 


Special Registers Integer Registers (REGS) 


Program lf Global Decoder 
Counter 


i Dependency Checker 
Unit Schedule and Control 


Exception Logic 


ISCHED 


Fetch 
Controller 


Instruction Address 
jg insiwetion | 
Instruction I Instruction Cache 
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chitecture employs two advanced concepts for increasing 
computer system performance: superscalability and superpi- 
pelining. 

The RT620C is a high performance full-custom CMOS im- 
plementation of integrated SPARC integer and floating-point 
logic, with an on-chip cache for instructions. 


Advanced architecture and manufacturing technologies give 
the RT620C ultra high performance without requiring soft- 
ware recompilation. Figure 2 is a logic block diagram of the 
RT620C. 


IDP. The Integer Data Path comprises several units. Two 
independent Arithmetic and Logic Units (ALUs) handle inte- 
ger arithmetic, logical, and shift instructions. The Load/Store 
Unit (LSU) handles instructions that load and store data be- 
tween memory and registers which includes the loading and 
storing of both integer and floating-point data. The Special 
Register Unit (SRU) handles instructions that read and write 
the SPARC Special Registers (SREGS). The Integer Register 
File (IREGS) is also contained in the IDP. 


FPDP. The Floating-Point Data Path also comprises several 
units. These are the Floating-Point Queue (FPQ), the Float- 
ing-Point Arithmetic Unit (FAU), The Floating-Point Multi- 
plier Unit (FMU), the Floating-Point Register File (FREGS), 


Floating Point Registers (FREGS) 


Floating Floating 
Point Point 
Scheduler 


Intramodule 


Bus 


Interface 


Instruction 


Figure 2. RT620C CPU Logic Block Diagram 
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and the Floating-Point Status Register (FSR). These floating- 
point units handle all SPARC floating-point instructions. 


ISCHED. The Integer Scheduler performs key control func- 
tions. It provides global instruction decodes to identify which 
execution unit resources are required, and determines wheth- 
er sequential or simultaneous execution is possible. 


The ISCHED also determines whether data forwarding can 
be performed and whether instruction dispatches (also called 
“Jaunches”) need to be delayed due to data dependencies. 
The ISCHED initiates instruction launch and identifies and 
controls interrupt and trap handling. 


FPSCHED. The Floating-Point Instruction Scheduler per- 
forms key control functions for the floating-point unit. When 
the Integer Unit detects floating-point instructions in the 
decode stage, it offloads these instructions to the floating- 
point functional units and continues processing. Therefore, 
functional blocks exist that perform necessary decode, sched- 
uling, and control for the floating-point operations. 


The FPSCHED performs floating-point instruction decoding, 
resolves floating-point data dependency and data-forwarding 
conditions, and provides the ISCHED with floating-point 
execution status. Delayed instructions are stored temporarily 
in the Floating-Point Instruction Queue (FPQ). Instructions 
are launched from the FPQ as data dependencies are 
resolved. 


IFETCH. The Instruction Fetch Unit consists of two major 
functional blocks referred to as the Program Counter Unit 
(PCU) and the Instruction Fetch Controller (FETCHC). 


The PCU calculates the address of the next instruction to be 
fetched. It handles instructions that cause program control 
transfer, such as CALL and BRANCH. This unit handles both 
integer and floating-point branch instructions. 


The IFETCHC fetches two instructions at a time, and in each 
clock cycle, the CPU attempts to launch both at once. 


ICACHE. The on-chip instruction cache is organized as a 
two way set associative buffer. The ICACHE stores 8 Kbytes 
of instructions. Its inclusion follows the Harvard architecture 
approach, reducing bus contention during memory accesses. 
The ICACHE has a high-performance, one-wait-state cache 
miss penalty. 


IBIU. The Intra-Module Bus Interface Unit provides the in- 
terface between the RT620C CPU and the external world. 
The IBIU samples incoming control signals and propagates 
controls to appropriate functional blocks. The IBIU is respon- 
sible for generating memory access control signals to the 
cache memory subsystem. Data and instructions are read 
from memory and data is written to memory, through the 
IBIU. 

Cache Control, Memory Management, and Tag Unit 
(RT626) 

The CMTU (RT626) is a combined Cache Controller and 
Memory Management Unit optimized for multiprocessing 
systems. The CMTU is a high-speed CMOS implementation 
of the SPARC Reference MMU, combined with cache, a 
memory controller, and on-chip physical cache tag memory. 


830-0014-00 Rev B 11/21/96 


Colorado 3 RS4026 


The CMTU supports the SPARC MBus Level 2 protocol for 
multiprocessing systems. Figure 3 depicts the CMTU block 
diagram. 


The CMTU directly connects to the RT620C Central Proces- 
sing Unit and RT628 Cache Data Units without any external 
circuitry. The RT626 CMTU uses four or eight RT628 CDUs 
to realize 512-Kbytes or 1-Mbyte, respectively, of zero-wait- 
state, direct-mapped virtual cache memory. 


MMU. The MMU portion of the CMTU provides translation 
from a 32-bit virtual address (4 gigabytes) to 36-bit physical 
address (64 gigabytes), as provided in the SPARC reference 
MMU specification. Virtual addresses are further extended 
with the use of a context register, which is used to identify up 
to 4096 contexts or tasks. The TLB entries contain context 
numbers to identify tasks or processes. This minimizes un- 
necessary TLB entry replacement during task switching. 


The CMTU performs its address translation task by compar- 
ing a virtual address supplied by the RT620C through the In- 
tra-Module Bus to the address tags in the TLB entries. If a 
“hit” occurs, the physical address stored in the TLB is used to 
translate the virtual-to-physical address. If the virtual address 
does not match any valid TLB entry, a “miss” occurs. This 
causes a table walk to be performed by the MMU. The table 
walk is a search performed by the MMU through the address 
translation tables stored in main memory. Upon finding the 
PTE, the MMU translates the address and selects a TLB entry 
for replacement. 


Cache Controller The CMTU’s cache controller supports 
two modes of caching: write-through with no write allocate 
and copy-back with write allocate. The cache is “virtually in- 
dexed” and “physically tagged.” 


In 1-Mbyte secondary cache versions, the cache is organized 
as 16384 lines with two sub-blocks, each of which is 32 bytes. 
Intra-Module Bus address bits IMA[19:6] select the cache 
line, IMA[5] selects the sub-block, and IMA[4:3] select the 
64-bit word of the cache line. In 512-Kbyte secondary cache 
versions, only one sub-block is populated, and IMA[18:5] 
select the cache line and IMA[4:3] select the 64-bit word of 
the cache line. 


The 16384 cache tag entries in the RT626 are virtual address 
indexed. From the processor side, the virtual address on the 
Intra-Module Bus is used to select a cache line entry and its 
corresponding cache tag entry. The translated physical 
address is then compared against the physical address in the 
selected cache tag entry to determine if the required data 
resides in the cache. 


From the MBus side, the superset virtual address bits are con- 
catenated with physical address bits [11:5] to select a cache 
line entry and its corresponding cache tag entry. The physical 
address on MBus is then compared against the physical 
address in the selected cache tag entry to determine if the 
required data resides in the cache. 


A 64-byte write buffer and a 32-byte read buffer are provided 
in the RT626 to fully buffer the transfer of cache lines. This 
feature allows the CMTU to simultaneously read a cache line 
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Figure 3. RT626 CMTU Block Diagram 


from main memory as it flushes a modified cache line from 
the cache. 


MBus. The CMTU supports the SPARC MBus interface stan- 
dard and the SPARC MBus Level 2 cache coherency proto- 
col. It supports data transfers in transaction sizes of 1, 2, 4, 8, 
or 32 bytes. These data transfers are performed in either burst 
or non-burst mode, depending upon the size. Data transac- 
tions larger than 8 bytes are transferred in burst mode. Bus 
mastership is granted and controlled by an external bus arbi- 
ter. 


The CMTU also supports the MBus Module Identifier feature 
of the MBus, in which it accepts the Module Identifier input 
from the MBus and embeds it in the MBus address phase of 
all MBus transactions initiated by the CMTU. 


Cache Data Units (RT628) 


The RT628 is organized as four arrays of 32-Kbytes each. It 
contains one-deep write buffer pipelines, byte write logic, 
registered inputs, data-in and data-out latches, and data for- 
warding logic for the write buffer. 


Writing into the RAM core is delayed until the next write 
access. To allow data forwarding, the CDU incorporates a 
comparator to compare the address of the write-buffer to the 


incoming read address. If a match occurs, data is forwarded 
directly from the write-buffer to satisfy the current read 
cycle. 


For a more complete description of the individual SPARC 
components used in the RS4026, please refer to the ROSS 
SPARC RISC User’s Guide and Appendix B. 


Module Design 
Advanced Packaging Technology 


The RS4026 employs multi-die packaging (MDP) technol- 
ogy to facilitate higher clock frequencies and reliable opera- 
tion. Each MDP component contains a complete hyper- 
SPARC CPU chipset. MDP technology improves electrical 
characteristics by reducing electrical parasitics, allowing 
multiple discrete chips to function as a single monolithic die. 


Clock Distribution 


The RS4026 uses three MBus clock signals (MCLK[O], 
MCLK[1], and MCLK[2]) as defined in the MBus Specifica- 
tion. In order to minimize clock skew, traces have been care- 
fully routed. All clock lines are routed on inner layers of the 
module PCB, and their lengths and impedances are matched. 
The MBus clock lines have diode termination to reduce sig- 
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Figure 4. RT628 Block Diagram 


nal undershoot and overshoot, and all intramodule clock lines 
use a parallel resistive termination of 602. 


MBus Connector (Module) 


The RS4026 interface is via the 100-pin SPARC MBus 
connector, which is a two-row male connector with 0.050” 
spacing (AMP part number 121354-4 or Fujitsu part number 
FCN-264P100-G/C). The connector is a controlled impe- 
dance-type (50Q +10%) based on a microstrip configuration 
that provides a controlled characteristic impedance plus very 
low inductance and capacitance. Separate power and ground 
blades are provided for isolation to prevent noise transfer- 
ence. Table 1 details the RS4026 standard connector pinout. 
This MBus connector supports Level 2 MBus. 


Mating MBus Connector (System Interface Board) 


The module connects to the system interface through the 
standard MBus female connector (vertical receptacle 
assembly, AMP part number 121340-4 or Fujitsu 
FCN-264J100-G/0). 


Reset and Interrupt Signals 


A power-on reset signal is generated to the module from the 
MBus via the RSTIN signal. Each CPU has its own direct set 


of interrupt lines. Level sensitive interrupts (15 max) are 
generated to each RT620 via the MIRLO[3:0] and 
MIRL1[3:0] lines from the MBus. A value of 0000b means 
that there is no interrupt, while a value of 1111b means an 
NMI (Non-Maskable Interrupt) is being asserted. MIRL 
values between | and 14 represent interrupt requests that can 
be masked by the processor. 


MBus Request and Grant Signals 
Two separate sets of request and grant signals (MBR[O] and 


MBG[0] for CPU0, MBR[1] and MBG[1] for CPU1) are gen- 
erated to/from the RS4026 modules to arbitration logic on the 


motherboard. 
MBus SCAN Test Feature 


The RS4026 module also supports the Boundary SCAN test 
feature of the MBus. For more details on the SCAN test, 
please refer to the ROSS SPARC RISC User’s Guide and 
SPARC MBus Interface Specification. 

MID Lines 


Each CPU on the RS4026 has a dedicated set of MID[3:0] 
lines. To ensure that each CPU has a unique MID number the 
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module ties CPUO (Channel 0) MID[O] to ground and CPU1 
(Channel 1) MID[O] to Vcc. 


Table 1. MBus Connector Pinout !!! 


Signal Name Signal Name Pin # | Signal Name Signal Name 
1 TDI TMS 









































Blade #1 2 MCLK[?2] Ground MERR 
TDO Ground TRST MCLK[3] MAS 
TCLK MIRLO[1] MBR{1] Ground MBB 
MIRLO[0] Ground MIRLO[3] MBG[1] RSVDO 
MIRLO[2] MAD[32] MAD{[33] 
MAD[0] Ground MAD{[34] Blade #4 MAD{35] 
MAD[2] MAD[36] Vcc MAD{[37] 
MADF[4] Ground MAD[38] MAD{[39] 
MAD{[6] MAD[40] Vcc MAD/41] 
MAD{[8] MAD[42] MAD[43 
MAD[10] | Blade #2 MAD[44] Vcc MAD[45] 
MAD[12 Vcc ] MAD[46] MAD[47] 
MAD[14] ] MAD[48] Vec MAD/[49] 
MAD[16] Vcc [17] MAD[50] MAD{[51] 
MAD[18] ] MAD[52] MAD{[53] 
MAD[20] Vcc ] MAD[S54] Blade #5 MAD[S55 
MAD[22] ] MAD[56] Ground MAD[57] 
MAD[24] Vcc ] MAD{58] MAD{[59] 
MAD[26] 7] MAD{[60] Ground MAD{[61] 
MAD[28] MAD[62] MAD[63] 
MADJ30] Blade #3 7] RSVD1 Ground MIRL1[0] 
MBR[0] Ground MIRL1[1] MIRL1[2] 
MBG[0] MIRL1[3] Ground AERR 
MCLK[0] Ground RSTIN MID[1] 
MCLK{[1] MID[2] MID[3] 
Notes: 
1. RES and RSVD pins are not used in the RS4026 but reserved for other for the assignment of these reserved pins per the SPARC MBus Specifica- 
MBus module upgrades. See the System Design Considerations section tion. 
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Absolute Maximum Ratings |”! 
(Provided as guidelines; not tested.) 


Parameter Description 


Maximum Power Consumption 


Storage Relative Humidity [°1 
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Supply Voltage Range —0.5 to +7.0 hea | 


Input Voltage Range —0.3 to +7.0 | Vv | 
Storage Temperature —20 to +75 ee. 


Vv 
sw 
A 
Vv 
2c 
% 


Parameter Description 


CC 
IH 


Input LOW Voltage 


Supply Voltage 
Input HIGH Voltage 


Output LOW Voltage Vcc = Min., Io, = 8.0 mA 
Input Leakage Current (non-clock pins) Vcc = Max., Vss < Vout < Vcc 


Output Leakage Current Vcc = Max., Vss < Vout < Vcc - 


= < < -l 
Input Leakage Current (clock pins) Vcc = Max., Vss < Vout < Vcc —-4 
= < < 1 


Output Short Circuit Current BD! 


Input Capacitance 


Vcc = Max., Vout = 0V 


Operating Ambient Air Temperature [6 
Operating Relative Humidity 1°] 


Output Capacitance 
Input/Output Capacitance 
Clock Input Capacitance 





Notes: 


2. 


al 


All power and ground pins must be connected to other pins of the same 
type before any power is applied to the RS4026. At least three clock 
cycles must be applied to set up the internal chip drivers properly. 
Non-condensing. Maximum rate of change of 30% per hour. 
Recommended use of this module does not include “‘hot-socketing” or 
“live-insertion” (i.e., it is not recommended that the RS4026 be placed in 
the MBus socket with the power supply on). 


Not more than one output should be tested at one time. Duration of the 
short circuit should not be more than one second. 

See Appendix A. hyperSPARC Module Thermal Specifications. This 
temperature should not be exceeded when device is consuming maxi- 
mum power with 300 linear feet per minute (LFM) airflow at sea level. 
Tested initially and after any design or process changes that may affect 
these parameters. 
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AC Electrical Characteristics Over the Operating Range |8.9! 


Param 


Description 


Synchronous signals [10] 


t 


CG 


~ 


tPpWH 
tPpWL 
tcsR 
tskU 
twopD 
twoH 
tmIs 
twIH 
tcop 
tcoH 
tcIs 
tcIH 
tpop 
tPoH 
tPIs 


tPIH 


Asynchronous Signals 


tRsT 


Notes: 


8. Test conditions assume signal transition times of 3 ns or less, a timing 10. 





MBus Reset Duration [!?1 





reference level of 1.5V, input levels of 0 to 3.0V, and output loading of 
120-pF capacitance, not including the module itself (with the excep- 11 


tion of MBus point-to-point 
loading of 40 pF). 


control signals, tested with an output 49 


9. All measurements made at MBus connector. 


MBaPonin Ron Comatipasectn |S 
x 
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Mewcoxpml TO 
Mewcoaninpms PT 
MewcCoatoneot | 


n n n n n n n n n n 5 


a 


All timing parameters are relative to one of the two processors (e.g., 
tmop is guaranteed relative to MCLK[0]). 


. Measured between any two MCLK signals. 
. This is the minimum time for which RSTIN must be asserted after 





both high and low power supply voltages are stable. 
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MBus Timing Diagrams 
Single Read Transaction 


CLK 





MAD[63:0] 


MRDY 


Single Write Transaction 
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System Design Considerations 


The RS4026 implements all but one of the possible MBus 
signals. The MBus connector, per the SPARC MBus Specifi- 
cation, assigns the optional signal INTOUT to Pin 10. 
Although this signal is not used on the RS4026, systems de- 
signers should be aware of this assignment to preserve com- 
patibility with other MBus modules. 


Table 2. Pins Reserved on RS4026 


58* RSVDO 
91* RSVD1 


* Non-floating. Reserved for ROSS internal use only. 
These signals should not be driven. 





All MAD, bused control, and point to point control signals 
use 8-mA drivers. The MSH and AERR signals use open- 
drain drivers. 


10-Kohm pull up resistors are required on MAS, MRDY, 
MRTY, MERR, MBB, and MIH. A 1.5-Kohm pull up resistor 
is recommended on AERR. A 619 ohm pull up resistor is rec- 
ommended on MSH. MAD signals require holding amplifi- 
ers. 
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In order to assure that all module scan circuitry is initialized 
to the normal operating state on reset, the following is recom- 
mended for the MBus scan signals. TDI and TMS should be 
pulled up to 5V with 10KQ resistors. TCLK must toggle at 
least 3 full cycles while TRST is asserted in order to reset all 
scan circuitry. RSTIN may be driven by the module when the 
module is in scan mode, so it should be buffered from the rest 
of the system. These requirements may be met by connecting 
each signal as shown in Figure 5. The RS4026 uses MCLK[0] 
to clock processor 0, MCLK[2] to clock processor 1, and 
MCLK[1] for test. 


As the frequency of operation increases, transmission line ef- 
fects play a bigger role. Care must be taken to keep skew be- 
tween any two clock signals at the MBus connector within the 
specifications given in the Synchronous Signals table in the 
AC Characteristics section. MBus signal lines must be routed 
carefully to minimize crosstalk and interference. A thorough 
SPICE analysis of the motherboard design is recommended. 


Use of HH Smith #4387 (3/4” length by 1/4” OD) stand-offs 
or equivalent is recommended on the motherboard to support 
the module and prevent damage to the connector. 








If mounting screws are used, nylon screws are recommended 
to prevent over-torquing and damage to the PCB. 


RS4026 





Figure 5. Scan Pin Connections 
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RS4026 Mechanical Drawing !13.14. 15] 





Top View 
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Figure 6. RS4026 Mechanical Dimensions 


Notes: 

13. Drawing is for reference only. Appearance of module is subject to 15. To ensure compliance with all future MBus modules, systems develop- 
change without notice. ers should design to the MBus module envelope per the SPARC MBus 

14. Drawing is not to scale. All dimensions are in inches (mm). Specification. 
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Module Serial Number *51163269600254* 


RS4026-150/512 03 9625 -17 Rey E 


ROSS Part Number Bill of Materials 


Revision 
Manufacturer Code 





Date Code Bill of Materials 





Figure 7. RS4026 Module Labeling 
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Appendix A. hyperSPARC Module Thermal 
Specifications 


Ambient Temperature 





Ambient temperatures as high as 40°C are acceptable for the 
RS4026 provided airflow is 300 linear feet per minute (LFM) 
minimum through the heatsink fins at all locations indicated 
in Figure 8. In this context, ambient temperature is defined as 
the air temperature in immediate proximity to the module. 


Module airflow measurements must be taken with the 
anemometer probe in front of the fins, approximately 1/2” 
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above the top of the PCB at the indicated locations. The air- 
flow must meet the minimum requirements at all locations 
indicated in Figure 8. When taking airflow measurements the 
module should be installed in a system that is configured in 
the same fashion as the actual final production system (for 
example, all external covers and panels should be installed, 
and any internal ducting or baffling should also be installed). 


Ambient temperature should be measured within the system, 
as it enters the fins of the heatsinks on the module. 


For further information regarding thermal measurements 
contact ROSS Applications Engineering. 


Top View 

















“ 



































Side View 





Figure 8. RS4026 Airflow Measurement 
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Appendix B. Impact of Cache Size on System 
Design. 
Hardware Differences 


hyperSPARC modules with 256-Kbytes of secondary cache 
utilize the RT625 Cache Controller, Memory Management, 
and Tag Unit (CMTU) and four RT627 Cache Data Units 
(CDUs). Modules with 512-Kbytes or 1-Mbytes of secondary 
cache utilize the RT626 CMTU and either four or eight 
RT628 CDUs. 


CDU. The RT628 is functionally equivalent to the RT627 
described in the ROSS SPARC RISC User’s Guide except that 
the RT628 is based on a 32-Kbyte x 32-bit SRAM core. 


CMTU. The RT626 is functionally equivalent to the RT625 
described in the ROSS SPARC RISC User’s Guide, with the 
few exceptions outlined below. 


The RT626 supports two cache sizes: 512-Kbyte and 
1-Mbyte. Cache lines are directly addressed by the RT620 
CPU with the Intra-Module Address Bus (IMA[31:0]). The 
512-Kbyte cache is organized into 16384 lines of 32 bytes 
each. IMA[18:5] select the cache line, and IMA[4:3] select 
the 64-bit word of the cache line, as illustrated in Figure 9. 
The 1-Mbyte cache is organized into 16384 lines with two 
sub-blocks, each sub-block being 32 bytes. Address bits 
IMA[19:6] select the cache line, address bit IMA[5] selects 
the sub-block, and address bits IMA[4:3] select the 64-bit 
word of the cache line, as illustrated in Figure 10. 





IMA[31:0] IMA[4:3] 





Intra-Module Data Bus 


830-0014-00 Rev B 11/21/96 


Colorado 3 RS4026 


The RT626 cache tag array consists of 16384 direct-mapped 
physical address cache tag entries. The layout of the cache 
tag entries is identical to that of the RT625. The 16384 CTAG 
entries are virtual address indexed. 


From the processor side, the cache line select field, 
IMA[18:5] in the case of the 512-Kbyte cache or IMA[19:6] 
in the case of the 1-Mbyte cache, is used to select a cache line 
entry and its corresponding cache tag entry. 


From the MBus side, the index field for CTAG, as supplied 
by the MBus, is formed by concatenating the superset virtual 
address bits [18:12] (MAD[52:46]) in the case of 512-Kbyte 
cache or [19:12] (MAD[53:46]) in the case of 1-Mbyte cache 
with physical address bits [11:5] (MAD[11:5]) as shown in 
Figure 9 and 10. 


The System Control Register (SCR) is the same in both the 
RT625 and RT626, except for the Cache Size bit (SCR[12]). 
In the RT626, CS=0 indicates a 512-Kbyte cache subsystem, 
and CS=1 indicates a 1-Mbyte cache subsystem. 


RT626 CTAG entries may be accessed using word LDST 
Alternate instructions with the cache tag entry address and 
ASI=Ox0E. Each tag entry can be read as a Load single or can 
be written as a Store single by the RT620. The address map- 
ping for the Cache Tag entries is shown in Table 3. 
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Figure 9. 512-KByte Cache Memory Subsystem 
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Figure 10. 1-Mbyte Cache Memory Subsystem 
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Figure 11. RT626 Cache TAG (CTAG) Comparison (512-Kbyte Cache) 
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Figure 12. RT626 Cache TAG (CTAG) Comparison (1-Mbyte Cache) 


Software Differences 

OBP. The Open Boot PROM (OBP) is responsible for deter- 
mining the size of the cache, setting the Cache Size bit in the 
System Control Register (SCR), and passing cache line size 
and number of cache lines on to the operating system kernel. 
The cache size may be hard coded, or it may be determined 
dynamically by scanning for the largest byte stored using the 
CACHE DATA ASI (OxF), as shown in the following pseudo- 
code segment: 


set (2*1024*1024), %00 =! 2-Mbyte cache 
sta %00, [%o0JASICACHE_ DATA 

set (1*1024*1024), %00 =‘! 1-Mbyte cache 
sta %00, [%o0JASI_CACHE_DATA 

set (512*1024), %00 =! 512-Kbyte cache 
sta %00, [%o0JASI_CACHE_DATA 

set (256*1024), %00 —‘! 256-Kbyte cache 
sta %00, [%o0JASI_CACHE_DATA 

set (128*1024), %00 =! 128-Kbyte cache 
sta %00, [%o0JASI_CACHE_DATA 

Ida [%g0]ASICACHE_DATA, %ol 

The register %o1 will contain the size of the installed cache, 


since addresses which are larger than the installed cache wrap 
around to virtual index 0. 


When flushing cache lines, a stride of 32-bytes will always 
work. For 256-Kbyte and 1-Mbyte cache sizes, it is more effi- 
cient for software to use a stride of 64-bytes, since both sub- 
blocks will be flushed. The OBP must use the smallest stride 
size supported by all CPUs and caches detected in the system. 


Cache sizes larger than 256-Kbytes are supported in OBPs 
available from ROSS (Rev 2.22.1H or later) or SunSoft (Rev 
2.25 or later). For more information, contact ROSS Applica- 
tions Engineering. 

Operating System. Generally, no operating system modifi- 
cations should be necessary due to changes in secondary 
cache size. Solaris 2.4 and earlier, however, contains a bug 
which limits operation to systems with cache sizes of 
256-Kbytes or less. The bug causes a data area to be overwrit- 
ten when the cache size exceeds 256-Kbytes. In all standard 
Solaris 2.4 kernels investigated by Sun, the data area over- 
written is not used, so the bug is not evident. It may be pos- 
sible, however, that a non-standard kernel with a different 
data layout may experience problems. This bug has been cor- 
rected in Solaris 2.5 and later. For more information, contact 
ROSS Applications Engineering. 
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Table 3. Cache Tag Entry Address Mapping 
512-Kbyte 


0002x H 00004x H 


0006x H 0000cx H bee | 





=e] 
ee ee ee ee 
[ae re 


(X = Don’t Care) 


17 


























SS Colorado 3 RS4026 


TECHNOLOGY , 


Appendix C. hyperSPARC Ordering Information 


Ordering Code CPU Clock Second-level MBus Clock 
Frequency (MHz) Cache size (per CPU) Frequency (MHz) 





RS4026-150/512 150 512K 


For up-to-date ordering and sales information contact: 


ROSS Technology, Inc. ROSS Technology, Inc., Europe 
5316 Hwy. 290 West Avenue Ernest Solvay, 80 
Austin, Texas 78735-8930 USA 1310 La Hulpe 
Telephone: (800) ROSS-YES Belgium 

(512) 349-3108 Telephone: +32 2 652 1014 
FAX: (512) 436-2471 FAX: +32 2 652 1062 


SPARC is a registered trademark of SPARC International, Inc. 
hyperSPARC is a trademark of SPARC International, Inc., used under permission by ROSS Technology, Inc. 


©rRoss Technology, Inc., 1994. The information contained herein is subject to change without notice. ROSS Technology, Inc. assumes no responsibility for the use of any circuitry other than circuitry embodied 
in a ROSS Technology, Inc. product. Nor does it convey or imply any license under patent or other rights. ROSS Technology does not authorize its products for use as critical components in life-support systems 
where a malfunction or failure of the product may reasonably be expected to result in significant injury to the user. The inclusion of ROSS Technology products in life-support systems applications implies that 
the manufacturer assumes all risk of such use and in so doing indemnifies ROSS Technology against all damages. 





